Cartoon-recognition using video & audio descriptors

نویسندگان

Ronald Glasberg

Amjad Samour

Khalid Elazouzi

Thomas Sikora

چکیده

We present a new approach for classifying mpeg-2 video sequences as ‘cartoon’ or ‘non-cartoon’ by analyzing specific video and audio features of consecutive frames in real-time. This is part of the well-known video-genreclassification problem, where popular TV-broadcast genres like cartoon, commercial, music, news and sports are studied. Such applications have also been discussed in the context of MPEG-7 [12]. In our method the extracted features from the visual descriptors are non-linearly combined using a multilayered perceptron and then considered together with the output of the audio-descriptor to produce a reliable recognition. The results demonstrate a high identification rate based on a large collection of 100 representative video sequences (20 cartoons and 4*20 noncartoons) gathered from free digital TV-broadcasting.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Recognition of Visual Events using Spatio-Temporal Information of the Video Signal

Recognition of visual events as a video analysis task has become popular in machine learning community. While the traditional approaches for detection of video events have been used for a long time, the recently evolved deep learning based methods have revolutionized this area. They have enabled event recognition systems to achieve detection rates which were not reachable by traditional approac...

متن کامل

Cartoon-recognition Using Visual-descriptors and a Multilayer-percetpron

We present a new approach for classifying mpeg-2 video sequences as ‘cartoon’ or ‘non-cartoon’ by analyzing specific color, texture and motion features of consecutive frames in real-time. This is part of the well-known videogenre-classification problem, where popular TVbroadcast genres like cartoon, commercial, music, news and sports are studied. Such applications have also been discussed in th...

متن کامل

Real-Time Approaches for Video-Genre-Classification using New High-Level Descriptors and a Set of Classifiers

In this paper we describe in detail the recent publications related to video-genre-classification and present our improved approaches for classifying video sequences in real-time as ‘cartoon’, ‘commercial’, ‘music’, ‘news’ or ‘sport’ by analyzing the content with high-level audio-visual descriptors and classification methods. Such applications have also been discussed in the context of MPEG-7 [...

متن کامل

MPEG-7 sound-recognition tools

The MPEG-7 sound-recognition Descriptors and Description Schemes consist of tools for indexing audio media using probabilistic sound models. The Descriptors provide containers for category labels, as well as data structures for quantitative information about sound content. We describe the normative tools, as well as informative methods, for automatic description extraction and sound matching.

متن کامل

A System Architecture for Multilingual Spoken Document Retrieval

Finding audio and video resources in internet is becoming an increasingly demanded application. However, search engines are usually limited to adjacent texts (hand supplied transcripts or close captions) to index and classify multimedia documents. Clearly, a key advantage can be taken from using automatic speech recognition and natural language processing technologies, since they allow to trans...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2005

Cartoon-recognition using video & audio descriptors

نویسندگان

چکیده

منابع مشابه

Recognition of Visual Events using Spatio-Temporal Information of the Video Signal

Cartoon-recognition Using Visual-descriptors and a Multilayer-percetpron

Real-Time Approaches for Video-Genre-Classification using New High-Level Descriptors and a Set of Classifiers

MPEG-7 sound-recognition tools

A System Architecture for Multilingual Spoken Document Retrieval

عنوان ژورنال:

اشتراک گذاری